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Structure of aldose reductase from Giardia lamblia 



Giardia lamblia is an anaerobic aerotolerant eukaryotic parasite of the 
intestines. It is believed to have diverged early from eukarya during evolution 
and is thus lacking in many of the typical eukaryotic organelles and biochemical 
pathways. Most conspicuously, mitochondria and the associated machinery of 
oxidative phosphorylation are absent; instead, energy is derived from substrate- 
level phosphorylation. Here, the 1.75 A resolution crystal structure of G. lamblia 
aldose reductase heterologously expressed in Escherichia coli is reported. As 
in other oxidoreductases, G. lamblia aldose reductase adopts a TIM-barrel 
conformation with the NADP + -binding site located within the eight /3-strands of 
the interior. 



1. Introduction 

1.1. Giardia lamblia 

G lamblia is the etiological agent of giardiasis, a common cause 
of diarrheal disease in the developing world. It is an anaerobic 
aerotolerant eukaryotic parasite of the lower intestine. Giardia sp. 
are believed to be basal eukaryotes, with a minimized genome and 
simplified metabolic pathways (Morrison et al., 2007). Mitochondria 
and the machinery of oxidative phosphorylation are absent, with 
energy being derived from substrate-level phosphorylation. Carbo- 
hydrate metabolism is largely limited to D-glucose (Lindmark, 1980), 
while other sugars, sugar alcohols, Krebs-cycle intermediates and 
organic acids have not been found to be utilized. Glucose is oxidized 
to ethanol and acetate as the fermentation end products (Lindmark, 
1980; Jarroll et al., 1981; Schofield et al., 1991), with ethanol being the 
dominant product under anaerobic conditions and acetate predomi- 
nating under aerobic conditions. When cultured in glucose-depleted 
media, Giardia grow slowly, achieve lower cell densities and secrete 
little ethanol (Schofield et al., 1990). While glucose appears to be 
the preferred carbon source, it is not however essential for Giardia 
growth. In the absence of glucose the carbon skeletons of amino acids 
are used as a substitute, with alanine secreted as the waste product. 
Ethanol production is largely absent under glucose-limited condi- 
tions. Alcohol dehydrogenase (ADH), while present in Giardia, does 
not appear to play a role in the production of the ethanol waste 
product. Cultures grown in the presence of pyrazole, an inhibitor 
of ADH, show no reduction in ethanol production. Instead, it is 
believed that ethanol is produced by the reduction of acetaldehyde by 
aldose reductase in an NADPH-dependent reaction. Giardia grown 
in the presence of sodium valproate, an inhibitor of aldose reductase, 
show diminished ethanol production and limited growth (Schofield et 
al., 1991). 

1.2. Aldose reductase 

Aldose reductases (EC 1.1.1.21) are ubiquitous NADPH-depen- 
dent oxidoreductases that catalyze the reduction of aldehydes to 
alcohols. As the name suggests, the aldehyde substrates have been 
classically characterized as aldose sugars; however, in recent years 
there has been an increased appreciation of the potential of these 
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enzymes to detoxify small aldehydes. Typical aldose reductases 
exhibit the TIM-barrel fold with eight a-helices on the exterior and 
an eight-stranded /3-barrel in the interior. Binding of the NADPH 
cofactor is facilitated by a deep elliptical pocket near the C-terminal 
end of the ^-barrel (Borhani et al, 1992; Wilson et al, 1992; Rondeau 
et al, 1992). Mutagenic studies have shown that tight cofactor binding 
is dependent on a conserved arginine residue (Arg269 in PDB entry 
3kbr; Bohren et al, 2005). The active site consists of two distinct 
pockets: a rigid anion-binding pocket and a flexible hydrophobic 
specificity pocket. A third region near the C-terminus has been shown 
to change position upon binding certain ligands (Urzhumtsev et al, 
1997; Kinoshita et al, 2002; Podjarny et al, 2004). Aldose reductase is 
common to both prokaryotes and eukaryotes, with TIM-barrel folds 
being reported from a diversity of species. Aldose reductase functions 
by a sequential ordered mechanism (Kubiseski et al, 1992; Grimshaw 
et al, 1995), with NADPH being bound before substrate. Activity 
involves a stereospecific transfer of the 4-pio-R hydride from 
NADPH to the substrate carbonyl C atom followed by the proton- 
ation of the substrate carbonyl O atom by a conserved tyrosine 
residue (Tyr40 in 3kbr). Proton transfer in the active site is assisted by 
a hydrogen bond between the active tyrosine and the £-amino group 
of a conserved lysine (Lys71 in 3kbr), which is itself linked by a salt 
bridge to a conserved aspartic acid (Asp35 in 3kbr). The substrate is 
oriented in the active site by a conserved histidine residue (Hisl04 in 
3kbr) (Del Corso et al., 2008). 

Owing to its role in human disease, human aldose reductase has 
been the subject of over thirty years of intense research. In diabetic 
hyperglycemia, the hexokinase of insulin-independent tissues 
becomes saturated with glucose, causing excess glucose to instead be 
shunted into the polyol pathway, where aldose reductase converts it to 
sorbitol. Disease results from sorbitol-induced hyperosmotic swelling 
and oxidative stress owing to the lowered glutathione concentration 
caused by the depletion of cellular NADPH reserves. A large number 
of inhibitors have been identified that bind the active site and many 
crystal structures of mammalian aldose reductases with bound inhi- 
bitors have been solved. Unfortunately, despite these efforts, very few 
of these compounds have made it past clinical trials (Del Corso et al, 
2008). 



2. Methods 

2.1. Protein expression and purification 

Full-length aldose reductase (Morrison et al., 2007) from G. lamblia 
ATCC 50803 was cloned into pAVA0421 vector (Alexandrov et al., 
2004) by ligand-independent cloning (LIC; Aslanidis & de Jong, 
1990) to produce a construct with an N-terminal hexahistidine tag 
that is cleavable with 3C protease (the entire tag sequence was 
MAHHHHHHMGTLEAQTQ'GPGS-ORF, in which the 3C clea- 
vage site is marked by a prime). Protein was expressed in Escherichia 
coli BL21 (DE3) cells in 2 1 auto-induction medium (Studier, 2005) in 
a LEX bioreactor (Harbinger, Markham, Ontario, Canada) at 293 K 
for 72 h, after which the harvested cells were flash-frozen in liquid 
nitrogen. The frozen cell pellet was thawed and resuspended by 
vortexing in 200 ml lysis buffer [20 mM HEPES pH 7.4, 300 mM 
NaCl, 5% glycerol, 30 mM imidazole, 0.5% CHAPS, 10 mM MgCl 2 , 
3 mM /3-mercaptoethanol, 1.3 mg ml -1 protease-inhibitor cocktail 
(Roche, Basel, Switzerland) and 0.05 mg ml -1 lysozyme]. The cell 
suspension was packed on ice and disrupted by sonication for 15 min 
in 5 s pulses at 70% amplitude using a Branson 450D Sonifier 
(Branson Ultrasonics, Danbury, Connecticut, USA). The lysate was 
incubated with 20 ul Benzonase nuclease (EMD Chemicals, 



Table 1 

Data-collection statistics. 



Values in parentheses are for the highest resolution shell. 



Beamline 


ALS 5.0.1 


Wavelength (A) 


0.9774 


Space group 


CI 


Unit-cell parameters (A, °) 


a = 196.77, b = 66.09, c = 56.29, 


Resolution range (A) 


= 92.26 


50-1.75 (1.80-1.75) 


Mean Ilo(I) 


17.0 (2.5) 


^merget 


0.073 (0.558) 


Completeness (%) 


99.9 (99.8) 


Multiplicity 


4.6 (4.1) 


No. of unique reflections 


72824 (5374) 



t /Urge = Em/ Ei mm - d(hki))\/j: hkl e^o- 



Gibbstown, New Jersey, USA) for 40 min at room temperature under 
gentle agitation. The lysate was clarified by centrifugation with a 
Sorvall RC5 at 10 000 rev min" 1 for 60 min at 277 K in a F14S rotor 
(Thermo Fisher, Waltham, Massachusetts, USA). The clarified solu- 
tion was syringe-filtered through a 0.45 um cellulose acetate filter 
(Corning Life Sciences, Lowell, Massachusetts, USA). The tagged 
protein was purified by affinity chromatography using a HisTrap FF 
5 ml column (GE Biosciences, Piscataway, New Jersey, USA) equi- 
librated in binding buffer (25 mM HEPES pH 7.0, 300 mM NaCl, 
5% glycerol, 30 mM imidazole, 1 mM DTT) and eluted with 500 mM 
imidazole in the same buffer. To cleave the N-terminal affinity tag, 
peak fractions were pooled and assayed for concentration by 280 nm 
spectrophotometry; 3C protease (Alexandrov et al., 2001) was mixed 
with the target in a 1:50 ratio and dialyzed overnight at 277 K against 
cleavage buffer (20 mM HEPES pH 7.6, 200 mM NaCl, 5% glycerol, 
1 mM DTT). Uncleaved target, protease and cleaved tag were 
removed by a second round of affinity chromatography using a 5 ml 
HisTrap column. The flowthrough and wash from secondary affinity 
chromatography were pooled and concentrated using an Amicon 
Ultra-15 30 kDa molecular-weight cutoff concentrator (Millipore, 
Billerica, Massachusetts, USA). The concentrated sample was further 
purified by size-exclusion chromatography (SEC) using a Superdex 
75 26/60 column (GE Biosciences) equilibrated in SEC buffer 
(20 mM HEPES pH 7.0, 300 mM NaCl, 5% glycerol and 2 mM DTT) 
attached to an AKTAprime plus FPLC system (GE Biosciences). 
Peak fractions were collected and assessed for purity by SDS-PAGE 
on 4-12% NuPAGE gels (Invitrogen, Carlsbad, California, USA) 
with Coomassie staining using SimplyBlue Safestain (Invitrogen). 
Pure fractions were pooled, concentrated to 23 mg ml -1 and flash- 
frozen in liquid nitrogen. The final concentration was determined by 
280 nm spectrophotometry and the final purity was assayed by SDS- 
PAGE. 

2.2. Crystallization 

With the purified protein at a concentration of 25.2 mg ml -1 in 
SEC buffer, two sparse-matrix screens were set up using JCSG+ 
(Emerald BioSystems, Bainbridge Island, Washington, USA), PACT 
(Molecular Dimensions, Suffolk, England), Index and Crystal Screen 
(Hampton Research, Aliso Viejo, California, USA) following an 
extended Newman's strategy (Newman et al., 2005). 0.4 ul protein 
solution was mixed with 0.4 ul well solution and equilibrated against 
100 ul reservoir using 96- well Compact Jr crystallization plates from 
Emerald BioSystems. Crystals were found under several PEG con- 
ditions, while crystals suitable for X-ray diffraction experiments were 
obtained from PACT condition H6: 20% PEG 3350, 200 mM sodium 
formate, 100 mM Bis-Tris propane. The crystals were cryoprotected 
by soaking them in a buffer consisting of 25 % ethylene glycol mixed 
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Table 2 

Refinement and model statistics. 



oeamlme 


at e cni 


Resolution range (A.) 


50-1.75 


-^crystt 


0.144 


Afreet 


0.173 


R.m.s.d. bonds (A) 


0.016 


R.m.s.d. angles (°) 


1.56 


Protein atoms 


4908 


Nonprotein atoms 


767 


Mean B factor (A 2 ) 


16.7 


Residues in favoured region 


580 (97.2%) 


Residues in allowed region 


17 (2.9%) 


Residues in disallowed region 


0 (0%) 


MolProbityX score (percentile) 


1.13 (99th) 


PDB code 


3krb 



t # cryst = Y,hki 1 1 F obs I ~ I ^caic 1 1 / Em/ I F obs I ■ The f r ee R factor was calculated using 5 % of 
the reflections omitted from the refinement (Winn et al, 2011). $ Chen et al (2010). 

with reservoir solution. The crystals were vitrified by plunging them 
directly into liquid nitrogen. 



scores. The model was then iteratively extended manually using Coot 
(Emsley & Cowtan, 2004) followed by cycles of reciprocal-space 
refinement with the CCPA program REFMAC5 (Murshudov et al, 
2011). The final model was validated with the validation tools within 
Coot and with MolProbity (Chen et al, 2010). 

The final model was refined at 1.75 A resolution to i? wor k = 0.144 
and Rf X&e = 0.173 with good stereochemistry (see Table 2). The 
observed structure extends from residue SerO (part of the purification 
tag) to Asp313 in both chains. The loop between residues Alalll and 
Thrll9 could not be modeled owing to weak electron density. One 
NADP + molecule and one ethylene glycol molecule were modelled in 
each chain. The nicotinamide ring of the cofactor is planar; hence, we 
assume that NADP + is bound to the protein in the oxidized form. 
The identity of a slightly prolate spheroid of electron density close to 
the nicotinamide ring of NADP + could not be established and was 
therefore modelled as 'unknown atoms' (UNK). A total of 611 water 
molecules were located. 



2.3. Data collection and structural determination 

Diffraction data were collected on the Berkeley Center for 
Structural Biology ALS 5.0.1 beamline as part of the Collaborative 
Crystallography program. The beamline uses a wavelength of 
0.9774 A and is equipped with an ADSC Quantum 210 CCD 
detector. The data were reduced in the monoclinic space group C2 to 
1.75 A resolution with XDSIXSCALE (Kabsch, 1988, 2010; see 
Table 1). 

Packing density (Matthews, 1968) suggested the presence of 
two molecules of aldose reductase in the asymmetric unit 
(V M = 1.58 A 3 Da" 1 , 52% solvent). A search of the PDB for sequence 
homology yielded a human aldose-reductase-like protein (AKR1B10) 
as the closest homolog with known structure (PDB entry lzua; 
Gallego et al, 2007), with 43% sequence identity. A search model was 
generated from molecule A of PDB entry lzua using the CCPA 
program CHAIN SAW (Stein, 2008; Winn et al, 2011). The structure 
was solved by molecular replacement with the CCPA program Phaser 
(McCoy et al, 2007); two molecules could be placed with high Z 



3. Results and discussion 

G. lamblia aldose reductase adopts a TIM-barrel fold with the 
NADP + -binding site located within the eight /3-strands of the interior 






Monomer A Monomer B 

Figure 1 

Dimer of aldose reductase from G. lamblia (GilaA.01452.aAl). Aldose reductase 
from G. lamblia forms a dimer of two TIM barrels. In this figure the twofold axis 
runs approximately vertically. The helices of the two monomers are shown as blue 
and purple ribbons. In each momomer, an NADP molecule is visible at the 
C-terminal end of the strands of the ^-barrel. This figure was prepared with 
CCPAmg (McNicholas et al, 2011). 




Figure 2 

(a) NADP is well defined in the structure of G. lamblia aldose reductase, (b) The 
adenine ring of NADP stacks hydrophobically with the side chains of His242 and 
Arg269. While Arg269 is conserved between human and Giardia aldose reductase, 
His242 of Giardia aldose reductase is an alanine in the human enzyme. This figure 
was prepared with CCP4mg (McNicholas et al, 2011). In both cases or A -weighted 
2|jP 0 | — |F C | electron density is shown in blue at la and cr A -weighted \F 0 \ — \F C \ 
electron density is shown in green and red at ±3cr. 
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(Fig. 1). Sequence alignment with Homo sapiens aldose reductase 
shows 44% identity; no structures in the PDB have an identity greater 
than 50%. The two monomers of G. lamblia aldose reductase super- 
impose with a root-mean-square deviation (r.m.s.d.) of 0.3 A (on C a 
atoms); a search of the PDB for structural homologues using SSMI 
PDBeFold (Krissinel & Henrick, 2004) yielded several structures 
of human aldose reductase with C*-atom r.m.s.d.s of 1.0-1.1 A. When 
compared with the structure of mammalian aldose reductase, subtle 
variations in structure are apparent. 

There is a distinct structural divergence of the N-termini of 
G. lamblia aldose reductase and the human protein: while the 
N-terminal 14 residues of both structures are well ordered, G. lamblia 
aldose reductase lacks the small N-terminal capping /3-sheet exhib- 
ited by mammalian structures (residues 2-14 of human aldose 
reductase). 

Cofactor binding by Arg269 is conserved between the two struc- 
tures. The side chain of His242 of G lamblia aldose reductase 
hydrophobically stacks with the adenine of NADP + , while this 
residue is an alanine in the human protein (Fig. 2). This histidine 
residue is not found in other aldose reductases of higher organisms 
and may be unique to Giardia. A flexible loop between the strand 
and helix 7 of the /3/a-barrel (dubbed the 'safety belt') has previously 
been reported to sequester solvent from the NADP + cofactor (Wilson 
et al, 1992). This flexible feature was not resolved in this structure. 
Tantalizingly, this region shows divergence between the mammalian 
and Giardia sequences. Sequence comparison with human aldose 
reductase by ClustalW (Larkin et al., 2007) analysis (Fig. 3) indicates 
that phenylalanine residues at positions 109 and 118 are conserved 
participants in forming the hydrophobic substrate-specificity pocket. 
Other residues identified as components of the mammalian specificity 
pocket (Klebe et al, 2004; Harrison et al., 1994; Sotriffer et al., 2004; 
Urzhumtsev et al., 1997) are divergent (Table 3). The residues of the 
anion-binding pocket and the active site appear to be completely 
conserved (Table 3) and their orientation is conserved in the three- 
dimensional structure. The residues flanking the C-terminal mobile 
region appear to be conservative substitutions, while the interior 
residues of this short region show no conservation (Table 3). 

The structure of the G. lamblia aldose reductase was superimposed 
with ligand-bound aldose reductases such as human aldose reductase 
in complex with fidarestat (PDB entry lef3; Oka et al., 2010) and 
porcine aldose reductase in complex with tolrestat (PDB entry lah3; 



Table 3 

Conservation of residues involved in substrate binding. 





H. sapiens 


G. lamblia 


Specificity pocket 


Thrll4 


Leul07 






A Iq^OzL 

/\iaj)U4 




Tyr310 


Pro310 


Anion-binding pocket 


Trp21 


Trpl2 




Val48 


Val39 




Tyr49 


Tyr40 




Hislll 


Hisl04 




Trpll2 


Trpl05 




Lys78 


Lys71 




Asp44 


Asp35 


C-terminal mobile region 


Val298 


Ile297 




Cys304 


Ala304 




Ala299 


Phe299 




Leu300 


Cys301 



Urzhumtsev et al, 1997) or sorbinil (PDB entry lahO; Urzhumtsev et 
al., 1997). All three inhibitors bind in close proximity to the 
nicotinamide group of NADP (Fig. 4). The protein models super- 
impose well overall, with r.m.s.d.s of 1.06 A (lef3), 1.03 A (lah3) and 
1.13 A (lahO). Despite high overall structural homology, there is a 
distinct difference in structure, separate from the mobile regions 
defined earlier, between the G. lamblia and mammalian enzymes 
close to the inhibitor-binding site. The C-terminal loop formed by 
residues Pro303-Leu311 diverges significantly from the structure of 
the corresponding loop in the human and porcine structures. In the 
human and porcine structures this loop is much more conserved and 
interacts with the inhibitors. In addition, structural rearrangements 
would be necessary for G. lamblia aldose reductase to harbor any of 
these three inhibitors. This structural diversity of the ligand-binding 
pocket in turn provides opportunities for the design of drugs that are 
specific for G. lamblia aldose reductase. 



4. Conclusion 

The structure of G. lamblia aldose reductase in complex with NADP + 
was solved to a resolution of 1.75 A. Residues in the active site and 
anion-binding pocket show conservation in sequence and structure 
with mammalian structures, while the nonphenylalanine residues 
involved in forming the specificity pocket show divergence. This 
structure did not resolve a flexible loop involved in solvent 



G. lamblia 

H. sapiens 



MQYPPRLGFGTWQAPPEAVQTAVETALMTGYRHIDCAYVYQNEEAIGRAFG 51 

MASRLLLNNGAKMPILGLGTWKSPPGQVTEAVKVAIDVGYRHIDCAHVYQNENEVGVAIQ 60 



G. lamblia 

H. sapiens 



KIFKDASSGIKREDVWITSKLWNYNHRPELVREQCKKTMSDLQVDYLDLFLVHWPLAFVR 111 
EKLREQV — VKREELFIVSKLWCTYHEKGLVKGACQKTLSDLKLDYLDLYLIHWPTGFKP 118 



G. lamblia 

H. sapiens 



NDVGDLFPKDAEGRAMLEKVPLADTWRAMEQLVEEGLVKHIGVSNYTVPLLADLLN — YA 169 
GKE — FFPLDESGNWPSDTNILDTWAAMEELVDEGLVKAIGISNFNHLQVEMILNKPGL 17 6 



G lamblia 
H. sapiens 



KIKPLVNQIEIHPWHPNDATVKFCLDNGIGVTAYSPMGG SYADPRDPSGTQKNVILE 226 

KYKPAVNQIECHPYLTQEKLIQYCQSKGIVVTAYSPLGSPDRPWAKPEDPS LLE 230 



* * -k-k-k 



G lamblia 
H. sapiens 



CKTLKAIADAKGTSPHCVALAWHVKKWNTSMYSVIPKSQTPARIEANFKCTEVQLSDDDM 28 6 
DPRIKAIAAKHNKTTAQVLIRFPMQRN LVVIPKSVTPERIAENFKVFDFELSSQDM 286 



G lamblia 
H. sapiens 



DAINNIHLNKRIRFCDPAI FWKVPLFD 313 

TTLLSYNRNWRVCALLSCTSHKDYPFHEEF 316 



Figure 3 

ClustalW (Larkin et al, 2007) comparison of the amino-acid sequences of H. sapiens and G. lamblia aldose reductase. 
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Figure 4 

Inhibitor-binding pocket of G. lamblia aldose reductase in comparison with ligand- 
bound human and porcine aldose reductases. G. lamblia aldose reductase (red 
model) superimposes with small r.m.s.d.s with human aldose reductase in complex 
with fidarestat (PDB entry lef3, dark blue) and porcine aldose reductase in 
complex with tolrestat (PDB entry lah3, green) and sorbinil (PDB entry lahO, 
cyan). Despite the overall structural homology, the C-terminal loop, which interacts 
with the ligand, shows a distinctly different trace. For clarity, the C atoms of the 
inhibitors follow the same color scheme as the corresponding ribbon. The NADP 
molecule is shown only for G. lamblia, with light green C atoms. This figure was 
prepared with CCP4mg (McNicholas et al, 2011). 

sequestration of NADP + . A conserved arginine residue is involved in 
cofactor binding together with an apparently Giardia-specific histi- 
dine residue. 
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